Rank and select revisited and extended

نویسندگان

  • Veli Mäkinen
  • Gonzalo Navarro
چکیده

The deep connection between the Burrows-Wheeler transform (BWT) and the socalled rank and select data structures for symbol sequences is the basis of most successful approaches to compressed text indexing. Rank of a symbol at a given position equals the number of times the symbol appears in the corresponding prefix of the sequence. Select is the inverse, retrieving the positions of the symbol occurrences. It has been shown that improvements to rank/select algorithms, in combination with the BWT, turn into improved compressed text indexes. This paper is devoted to alternative implementations and extensions of rank and select data structures. First, we show that one can use gap encoding techniques to obtain constant time rank and select queries in essentially the same space as what is achieved by the best current direct solution (and sometimes less). Second, we extend symbol rank and select to substring rank and select, giving several space/time tradeoffs for the problem. An application of these queries is in position-restricted substring searching, where one can specify the range in the text where the search is restricted to, and only occurrences residing in that range are to be reported. In addition, arbitrary occurrences are reported in text position order. Several byproducts of our results display connections with searchable partial sums, Chazelle’s two-dimensional data structures, and Grossi et al.’s wavelet trees. ⋆ Part of this work appeared in Proc. LATIN’06, pp. 703–714, LNCS 3887. 1 Funded by the Academy of Finland under grant 108219. 2 Funded by Millennium Nucleus Center for Web Research, Grant P04-067-F, Mideplan, Chile. Preprint submitted to Elsevier Science 31 October 2006

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Hebrew University of Jerusalem Secretary Problem of Minimizing Expected Rank: a Simple Suboptimal Approach with Generalizations

The secretary problem for selecting one item so as to minimize its expected rank, based on observing the relative ranks only, is revisited. A simple suboptimal rule, which performs almost as well as the optimal rule, is given. The rule stops with the smallest i such that Ri ≤ ic/(n + 1 − i) for a given constant c, where Ri is the relative rank of the i observation, and n is the total number of ...

متن کامل

Face Recognition Based Rank Reduction SVD Approach

Standard face recognition algorithms that use standard feature extraction techniques always suffer from image performance degradation. Recently, singular value decomposition and low-rank matrix are applied in many applications,including pattern recognition and feature extraction. The main objective of this research is to design an efficient face recognition approach by combining many tech...

متن کامل

GENERALIZED JOINT HIGHER-RANK NUMERICAL RANGE

The rank-k numerical range has a close connection to the construction of quantum error correction code for a noisy quantum channel. For noisy quantum channel, a quantum error correcting code of dimension k exists if and only if the associated joint rank-k numerical range is non-empty. In this paper the notion of joint rank-k numerical range is generalized and some statements of [2011, Generaliz...

متن کامل

Rank 3 Inhabitation of Intersection Types Revisited (Extended Version)

We revisit the undecidability result of rank 3 intersection type inhabitation (Urzyczyn 2009) in pursuit of two goals. First, we strengthen the previous result by showing that intersection type inhabitation is undecidable for types of rank 3 and order 3, i.e. it is not necessary to introduce new functional dependencies (new instructions) during proof search. Second, we pinpoint the principles n...

متن کامل

On quantum and classical Poisson algebras

Results on derivations and automorphisms of some quantum and classical Poisson algebras, as well as characterizations of manifolds by the Lie structure of such algebras, are revisited and extended. We prove in particular somehow unexpected fact that the algebras of linear differential operators acting on smooth sections of two real vector bundles of rank 1 are isomorphic as Lie algebras if and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Theor. Comput. Sci.

دوره 387  شماره 

صفحات  -

تاریخ انتشار 2007